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In the present study we extended errorless learning to a conditional temporal discrimination. Pigeons’ 
responses to a left-red key after a 2-s sample and to a right-green key after a 10-s sample were 
reinforced. There were two groups: One learned the discrimination through trial and error and the 
other through an errorless learning procedure. Then, both groups were presented with three types of 
tests. First, they were exposed to intermediate durations between 2 s and 10 s, and given a choice 
between both keys (stimulus generalization test). Second, a delay from 1 s to 16 s was included between 
the offset of the sample and the onset of the choice keys (delay test). Finally, pigeons learned a new 
discrimination in which the stimuli were switched (reversal test). Results showed that pigeons from the 
Errorless group made significantly fewer errors than those in the Trial-and-Error group. Both groups 
performed similarly during the stimulus generalization test and the reversal test, but results of the delay 
test suggested that, on long stimulus trials, responding in the errorless training group was less disrupted 
by delays. 
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When experimental psychologists train dis¬ 
criminations, in addition to reinforcing cor¬ 
rect responses they typically allow errors (that 
is, incorrect responses) to occur and be 
extinguished. Before the 1960s it was believed 
that discrimination learning required the 
selection of both correct and incorrect choices 
so that the consequences of each could be 
experienced (Hull, 1950; Spence, 1936). How¬ 
ever, Terrace (1963a) developed a new meth¬ 
od, later termed errorless learning, which allows 
learning to occur with few or no responses to 
the incorrect choice. In his first experiment, 
pigeons learned a simple successive discrimi¬ 
nation between a red key (S+) and a green key 
(S—). Initially the red key was presented with 
full brightness and duration, whereas the 
green key was dark and presented only for a 
short duration. Gradually, the brightness and 
duration of the S— increased until the stimuli 
differed only in hue {Jading) . The main result 
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showed that the group trained with the fading 
procedure gave few or no responses to the S—. 

Terrace’s subsequent studies with animals 
(Terrace, 1963b,c; 1964; 1966a; 1969; 1971; see 
also Terrace, 1966b, 1972) and humans 
(Terrace, 1974) confirmed and extended his 
original findings. He found that subjects 
exposed to errorless training performed dif¬ 
ferently in a variety of tests than subjects 
exposed to trial-and-error training. Specifical¬ 
ly, subjects trained with the fading procedure 
did not show explicit “emotional” responses, 
such as wing flapping and striking the key, in 
the presence of the S—. Perhaps because these 
subjects did not experience nonreinforcement 
from responding to the S—, the S— did not 
become an aversive stimulus and therefore 
escape and antagonistic responses in its 
presence did not occur. Terrace also found 
that the generalization gradients for the 
errorless discrimination group did not show 
peaks shifted away from the S— (i.e., no peak 
shift) or with a minimum at the S— (i.e., no 
inhibitory stimulus control), did not increase 
response rate or decreased response latency to 
the S+ (i.e., absence of behavioral contrast), 
and did not show a disruption in their 
performance when sedatives (chlorpromazine 
and imipramine) were administered. Terrace 
(1972) classified these behavioral phenomena, 
which occurred only following a discrimina¬ 
tion learned through trial and error, as by¬ 
products of discrimination learning. 
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Terrace’s findings had a significant impact 
and generated a considerable number of 
follow-up studies, both basic and applied 
(e.g., basic: Arantes & Berg, 2009; Lyons, 
1969a,b; Mackay & Brown, 1971; Moore & 
Goldiamond, 1964; Rilling & Caplan, 1975; 
Robinson, Foster & Bridges, 1976; Schuster- 
man, 1966; Sidman & Rosenberger, 1967; 
Wessells, 1973; applied: Barlow & Agras, 
1973; Birkan, McClannahan & Krantz, 2007; 
Corey & Shamow, 1972; Pace, Iwata, Cowdery, 
Andree & McIntyre, 1993; Sidman & Stoddard, 
1967; Touchette & Howard, 1984). The strong 
interest shown by applied researchers for 
errorless learning has continued until the 
present because reducing errors and their 
associated costs is beneficial in a variety of 
settings (e.g., interventions with children and 
adults with developmental disabilities). How¬ 
ever, interest among basic researchers de¬ 
clined after the 1970s, which may have been 
due to criticisms of Terrace’s original studies 
raised by some researchers (e.g., Kodera & 
Rilling, 1976; Lyons, 1969a,b; Marsh & John¬ 
son, 1968; Rilling, Kramer & Richards, 1973; 
Wessells, 1973). Specifically, these studies 
found that some of the by-products of discrim¬ 
ination learning have little relationship to the 
occurrence or nonoccurrence of errors during 
S— (see Rilling, 1977, for review). 

One limitation of errorless learning proce¬ 
dures was that they applied only to a restricted 
range of discriminations, typically simple 
discriminations in the visual modality. Some 
researchers (e.g., Gollin & Savoy, 1968; Gur- 
alnick, 1975; Schilmoeller, Schilmoeller, Etzel 
& LeBlanc, 1979) unsuccessfully tried to teach 
conditional discriminations to children based 
on simple discriminations learned through 
fading procedures. Others were only partly 
successful: Schilmoeller et al. were able to 
teach an errorless conditional discrimination 
to participants after teaching them two simple 
discriminations, not through fading but stim¬ 
ulus shaping, a technique that involves manip¬ 
ulating the topographical configuration of 
visual stimuli. To illustrate, in one condition 
of Schilmoeller et al.’s study, the S+ was a 
triangle with a single stripe background and 
the S— was a circle with a single stripe 
background, both presented on a card. Initial¬ 
ly, the S+ was manipulated to look like a tree 
on a hill, while the S— was a blank white card. 
Then, the shape of the S~ was changed from 


“an apple with a worm” to the circle with a 
single stripe background while the S+ re¬ 
mained constant as the “tree on a hill”. 
Finally the S+ changed from the “tree on a 
hill” to the triangle with a single stripe 
background. Results showed that 12 out of 
the 16 subjects made none or very few 
responses (S 8% incorrect responses) during 
the learning of the conditional discrimination 
between the final S+ and S— (i.e., triangle vs. 
circle with a single stripe background). 

Stubbs (1968) adapted a fading procedure 
to teach a conditional temporal discrimination 
in which responses to a red key were rein¬ 
forced if the preceding sample was short (from 
1 s to 5 s), whereas responses to a green key 
were reinforced if the preceding sample was 
long (from 6 s to 10 s). Group 1 (N = 3 
pigeons) was exposed to all sample durations 
from the beginning of training. The S+ was a 
full red key and the S— was a dark green that 
gradually became brighter such that at the end 
of the training both keys varied only in terms 
of hue. Group 2 (N = 3 pigeons) was exposed 
initially only to the two extreme sample 
durations (i.e., 1 s and 10 s); the other 
durations were introduced after performance 
stabilized. Although Stubbs stated that both 
groups learned through fading procedures, 
Group 2 actually acquired the discriminations 
through trial-and-error in two different phases, 
first with the extreme sample durations and 
then with the other eight intermediate dura¬ 
tions. Because Stubbs’ goal was not to study 
errorless learning per se, but the psychophysical 
relationship between sample durations and 
choice responding, he did not report the 
number of errors made by the pigeons during 
training. However, it seems that performance 
after fading (Group 1) was not particularly 
good. First, after being exposed to the fading 
procedure for more than 45 sessions, the 
pigeons from Group 1 had to be given 
additional sessions (M = 89) until their 
performance stabilized. Second, his Figure 1 
suggests that the percentage of correct re¬ 
sponses to the 10-s sample (longest duration) 
was lower in Group 1 than in Group 2 (range 
from Group 1: 75% to 90%; range from Group 
2: 85% to 93%). 

Because there are not enough data to make 
conclusions about the effectiveness of the 
fading procedure in training a temporal 
discrimination, the present study aimed to 
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answer this question. In a matching-to-sample 
task, pigeons learned to choose a red left key 
after a 2-s sample and a green right key after a 
10-s sample. For one group of pigeons, both 
keys were lit simultaneously after the sample 
and no attempt was made to prevent errors 
(Trial-and-Error group). For another group of 
pigeons, a novel errorless learning procedure 
was used (Errorless group). 

The errorless procedure described below is 
based on the concept of mediating behavior. 
Some theories of timing (e.g., the Behavioral 
theory of Timing, BeT; the Learning-to-Time 
model, LeT) hypothesize that temporal dis¬ 
crimination is affected by mediating behavior¬ 
al states, and a number of studies have 
supported the hypothesis that, at least in some 
tasks, subjects use their own behavior to 
discriminate sample durations (e.g., Fetter- 
man, Killeen & Flail, 1998; Machado & 
Arantes, 2006). For example, a pigeon ex¬ 
posed to a discrimination between 2-s and 10-s 
samples presented on a center key might start 
the trial facing the front panel, then approach 
the center key and peck it once or twice; if the 
signal persists because the sample is long, the 
pigeon may then peck the center key at a 
higher rate. Differences in the ongoing behav¬ 
ior at the moment of choice correlate with, 
and thus may mediate, choice performance. 

In our experiment, we used a fading 
procedure to establish a pattern of behavior 
that should facilitate the acquisition of a 
temporal discrimination. If successful, the 
fading procedure should train the discrimina¬ 
tion with few or no errors. Specifically, we 
trained one group of pigeons to face the 
intelligence panel at trial onset, stand for a few 
seconds in front of the left key (which later 
would be the correct comparison stimulus 
following the short sample), and then move 
to the right key (later the correct comparison 
stimulus following the long sample) and stand 
in front of that key until the end of the trial. 
Initially, the positive stimulus (left red or right 
green key, depending on the duration of the 
sample) also was lit during the sample. After 
the pigeons responded reliably, we introduced 
a delay between the illumination of the sample 
and the positive stimulus. When the delay 
reached 100%, the positive stimulus was lit 
only after the sample had elapsed (2 s or 10 s). 
Next, on an increasing percentage of trials the 
negative stimulus also was lit together with the 


positive stimulus. Finally, both comparison 
stimuli were always lit simultaneously after 
the sample. We expected that the pigeons 
would learn the temporal discrimination with 
few or no errors. 

To determine if there were behavioral 
differences between subjects that learned the 
task via trial-and-error or errorless learning, we 
compared the performance of the two groups, 
henceforth named Errorless group and Trial- 
and-Error group, on three types of tests. First, 
the pigeons were exposed to sample durations 
ranging from 2 s to 10 s and given a choice 
between the two comparisons (stimulus gen¬ 
eralization test). We were interested in deter¬ 
mining not only whether the generalization 
gradients differed, but also whether the 
mediating behavior of the Errorless group, 
established in the pretraining, predicted the 
results from the generalization test. 

Second, the pigeons were exposed to a delay 
test in which blackouts of 1 s to 16 s were 
interposed between the offset of the sample 
and the onset of the comparison stimuli. The 
goal was to determine whether responding 
acquired through trial-and-error or errorless 
learning would be differentially resistant to 
disruption. 

Third and finally, the pigeons learned a new 
discrimination in which the comparison stim¬ 
uli were switched, that is, a peck on the left red 
key now produced food after the 10-s samples, 
whereas a peck on the right green key 
produced food after the 2-s samples (reversal 
test). Marsh and Johnson (1968) examined 
reversal learning following the acquisition 
through a fading procedure of a successive 
simple discrimination between a red (S+) and 
a green key (S—). The results were mostly 
negative: 4 of their 5 pigeons made fewer than 
two responses to the new S+ (which was the S— 
during the previous discrimination) during 
five test sessions. Based on this result, Rilling 
(1977) concluded that “errorless learning is 
clearly not the best learning for an organism 
exposed to a changing environment”. Howev¬ 
er, other studies have shown appropriate 
reversal following errorless learning (e.g., 
Aronsohn, Castillo & Pinto-Hamuy, 1978; 
Robinson & Storm, 1978; Schusterman, 1965, 
1966; Stettner & Matyniak, 1969). 

In our experiment, both comparison stimuli 
were associated with reward and nonreward, 
albeit in different (sample) contexts. There- 
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fore, we wanted to determine whether the 
Errorless and Trial-and-Error pigeons differed, 
when later exposed to the reversed contingen¬ 
cies, in their speed of detecting the new 
response-reinforcer relationship. 

METHOD 

Subjects 

Twelve pigeons ( Columba livia ) were main¬ 
tained at 85% of their free-feeding weights 
(± 15 g) through appropriate postsession 
feeding. Pigeons were housed individually 
in a vivarium with a 12h/12h light/dark 
cycle (lights on at 07:00 am). Eight pigeons 
were experimentally naive and the remain¬ 
ing 4 were experienced with choice proce¬ 
dures (concurrent schedules), but had not 
been exposed to timing-related tasks. Water 
and grit were freely available in the home 
cages. 

Apparatus 

Eight standard three-key operant chambers, 
35 cm deep by 36 cm wide by 35 cm high, were 
used. The keys were arranged in a row, 26 cm 
above the floor. The side keys could be 
illuminated from behind with red or green 
light. In each chamber there was a houselight 
located 7 cm above the center key and a grain 
magazine with a 6-cm by 5-cm aperture located 
13 cm below the center key. The magazine was 
illuminated when wheat was made available. A 
force of approximately 0.10 N was necessary to 
operate each key, and effective responses 
produced an audible feedback click. Cham¬ 
bers were enclosed in a sound-attenuating box, 
and ventilation and white noise were provided 
by an attached fan. Event scheduling and data 
recording were controlled with a MEDSTATE® 
notation program and a MED-PC® system 
interfaced to an IBM®-compatible microcom¬ 
puter that was located in an adjacent room. 

Procedure 

Sessions were conducted 7 days a week at 
approximately the same time of day. The 
pigeons were divided into two groups, an 
Errorless group (Pigeons 201, 202, 207, 208, 
227 and 228), and a Trial-and-Error group 
(Pigeons 203, 204, 205, 206, 225 and 226). 
Four pigeons from each group were experi¬ 
mentally naive (Pigeons 201-208), and thus 


learned first to peck the keys using an 
autoshaping procedure. The remaining 2 
birds of each group (Pigeons 225-228) were 
exposed directly to the experimental proce¬ 
dure. Because of a programming error, during 
the fading of the multiple FI 2 s FI 10 s for 
Pigeon 207, the percentage delay between the 
illumination of the houselight (sample) and 
the response key changed abruptly from 10% 
to 80%. Therefore, results from Pigeon 207 
were excluded from analysis. 

Pretraining - Errorless Group 

The pretraining, which pertains only to the 
Errorless group, was composed of three 
phases. Table 1 summarizes the main features 
and number of sessions in each phase. The 
main goal of Phases I and II was to train 
mediating behaviors that would facilitate 
acquisition of the temporal discrimination. 
Specifically, we wanted the pigeons to stand in 
front of the left key at the beginning of each 
trial, and switch to standing in front of the 
right key after several seconds had elapsed. To 
that end, during Phases I and II, only one key 
was illuminated on each trial (e.g., the left key 
on 2-s trials and the right key on 10-s trials). 
The duration of the houselight (2 s or 10 s) 
was the sample. The difference between both 
phases was that whereas in Phase I the 
response key and the houselight were illumi¬ 
nated simultaneously, in Phase II the response 
key was illuminated only after a delay that 
increased gradually such that, in the end, the 
key was illuminated only when the houselight 
was turned off. In Phase III, discrimination 
trials were introduced gradually, that is, on 
some trials the left and the right keys were 
turned on simultaneously after the sample, 
and a response to either key ended the trial 
with reinforcement. Finally, in the baseline 
procedure all trials were (the usual) choice 
trials: Following the sample stimulus, the two 
comparison stimuli were presented. 

Phase I: Multiple FI 2 s FI 10 s. Pigeons were 
initially exposed to a multiple FI 2 s FI 4 s 
schedule; then, across several sessions, the 
interval of the longer schedule was increased 
gradually to 10 s (i.e., multiple FI 2 s FI 10 s). 
Sessions lasted for 60 trials. Each trial was 
preceded by a 30-s ITI during which all the 
lights were off. After the ITI, the houselight was 
turned on and a side key was illuminated (red 
or green, depending on the sample duration). 
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Table 1 


Main features of Phases I, II and III of the Pretraining, which pertain only to the Errorless group. 


Phase 

Sample duration (s) 

Delay (% of the 
FI value) 

Number of 
discrimination trials 

Number of 
sessions (average) 

left key 1 

right key 1 

Phase I 

2 

4 

0 

0 

3 


2 

6 

0 

0 

3 


2 

8 

0 

0 

3 


2 

10 

0 

0 

5 

Phase II 

2 

10 

10 

0 

3 


2 

10 

20 

0 

3 


2 

10 

30 

0 

3 


2 

10 

40 

0 

3 


2 

10 

50 

0 

3 


2 

10 

60 

0 

3 


2 

10 

70 

0 

3 


2 

10 

80 

0 

3 


2 

10 

90 

0 

3 


2 

10 

100 

0 

5 

Phase III 

2 

10 

100 

o 2 

3 


2 

10 

100 

12 

1 


2 

10 

100 

36 

1 


1 Counterbalanced across birds 

2 A peck on the response key initiated a 0.5 s delay during which both S + and S were presented, followed by food 
presentation and then the ITI. 


Although the assignment of the red and green 
colors to the two sample durations and to the 
left and right keys was counterbalanced across 
birds, for clarity we describe the procedure as if 
all birds were trained as follows: On half of the 
trials, the schedule in effect was the FI 2 s and 
the left key was illuminated red; on the 
remaining trials, the schedule in effect was the 
FI 10 s and the right key was illuminated green. 
When the FI duration elapsed, the houselight 
was turned off and the first response to the 
illuminated key was reinforced. During rein¬ 
forcement, the keylight was turned off and the 
grain hopper was raised and illuminated for 3 s. 
If the pigeon did not respond within 5 s after 
the houselight was turned off, the trial was 
canceled. Training in this phase continued 
until response rates appeared approximately 
stable in both components for all pigeons, and 
it lasted 14 sessions on average. 

Phase II: Fading of the multiple FI 2 s FI 10 s. 
The multiple FI 2-s FI 10-s schedule was 
identical to Phase I, with the following 
exception. Whereas in Phase I the response 
key was illuminated simultaneously with the 
houselight, in Phase II the response key was 
illuminated only after a delay. The delay was 
always a percentage of the FI schedule in effect 
and increased gradually from 10% to 100% in 
steps of 10% across blocks of sessions. Each 


fading value was maintained for at least three 
sessions, and until the response rates appeared 
stable. When the delay had reached 100% of 
the FI value, trials proceeded as follows: The 
houselight was illuminated for either 2 s or 
10 s, and then it was turned off as the 
corresponding side key was illuminated with 
either red or green light. The first response to 
the illuminated key darkened the key and 
produced reinforcement. This phase lasted on 
average 32 sessions. 

Phase III: Gradual introduction of the S—. This 
phase lasted five sessions and consisted of two 
steps. During the first three sessions, the first 
response to the sole lit key initiated a 0.5-s 
delay during which both S+ and S— were 
presented, followed by reinforcement. The 
goal of this step was to expose the pigeons to 
the S— in a context in which they would not be 
able to respond to it. In the second step, over 
the next two sessions, the pigeons were 
exposed to choice trials in which both S+ 
and S— were illuminated immediately after the 
houselight was turned off. In the first session, 
there were 12 choice trials (6 for each sample 
duration) randomly intermixed with 48 only 
S+ trials. In the second session, there were 36 
choice trials (18 for each sample duration) 
randomly intermixed with 24 only S+ trials. 
During choice trials, after the sample duration 
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had elapsed, the houselight was turned off and 
the left and the right side keys were illuminat¬ 
ed red and green, respectively. A peck to 
either key turned all lights off. If the choice 
was correct (i.e., left-red key after a 2-s sample 
or right-green key after a 10-s sample), the 
hopper was activated and the ITI followed; if 
the choice was incorrect, the ITI began 
immediately. To reduce the number of poten¬ 
tial errors (i.e., responses to S—) that pigeons 
could make during the gradual introduction 
of the choice trials, if pigeons made a response 
to the dark key that corresponded to S— after 
at least 80% of the sample duration had 
elapsed, the trial was cancelled by turning off 
the houselight and starting the ITI. 

Baseline Training - Errorless Group 

Sixty choice trials comprised each session 
and were identical to those of Phase III of 
pretraining for the Errorless group (see above). 
Baseline training continued until pigeons 
reached the criterion of 85% correct responses 
for each sample over three consecutive sessions. 

Baseline Training - Trial-and-Error Group 

The Trial-and-Error pigeons started the 
experiment in the Baseline condition. This 
condition was similar to that for the Errorless 
group, except that pecks on a dark key did not 
cancel the trial. Baseline training continued 
until the same criterion of 85% correct 
responses for each sample over three consec¬ 
utive sessions was reached. 

Testing 

After birds had reached criterion, they were 
exposed to three different tests: stimulus 
generalization test, delay test and reversal test. 
Contingencies during all test sessions were the 
same for both groups. 

Stimulus generalization test. Each session 
consisted of 30 test trials randomly intermixed 
with 30 choice trials identical to those used in 
baseline for the Trial-and-Error group. Sample 
durations on test trials were sampled from a set 
of three logarithmically-spaced values, in 
which the middle duration corresponded to 
the geometric mean of the training durations: 
3.0 s, 4.5 s, and 6.7 s. Each of these test 
durations occurred 10 times per session. 
Because we wanted the Errorless group to 
experience as little extinction as possible 


throughout the experiment, responses on test 
trials were always nondifferentially reinforced 
for both groups. The stimulus generalization 
test lasted for three sessions. 

Delay test. After the stimulus generalization 
test, pigeons were given three additional 
sessions of baseline training prior to the delay 
test. During the delay test, each session lasted 
for 60 trials. These were identical to choice 
trials from baseline training except that after 
the sample duration had elapsed and the 
houselight was extinguished, the side keys were 
lit simultaneously following a delay. Five differ¬ 
ent delays were arranged in each session: 1 s, 2 s, 
4 s, 8 s, and 16 s. Each delay occurred 12 times 
per session, 6 for each of the two sample 
durations. During the delay, the chamber was 
dark. After the side keys were lit, a response to 
the correct key produced reinforcement. The 
delay test lasted 20 sessions. 

Reversal test. After the delay test, pigeons 
returned to baseline for three sessions. During 
the reversal test the procedure was similar to 
baseline except that the contingencies were 
reversed. To receive food, pigeons needed to 
choose the right-green key after the 2-s 
samples and the left-red key after the 10-s 
samples. Training continued until the pigeons 
reached an accuracy criterion of at least 85% 
correct responses following each sample for 
three consecutive sessions. 

RESULTS 

Pretraining - Errorless Group 

Phases I and II: Multiple FI 2 s FI 10 s and 
fading of the multiple FI 2 s FI 10 s. Our first 
question was whether the fading procedure 
established the desired mediating behavior for 
the Errorless group. Figures 1 and 2 address 
this issue. Figure 1 shows the average number 
of responses to the correct key (i.e., left-red 
key on 2-s trials and right-green key on 10-s 
trials) made on each trial while the sample was 
on. The data come from the last five sessions 
in Phase I (i.e., prior to the fading of the 
multiple FI 2 s FI 10 s), for each block of three 
sessions in Phase 2 (i.e., during the fading 
procedure), and for the last five sessions in 
Phase II (i.e., after the fading procedure). 
Responses were counted regardless of whether 
or not the key was lit. Figure 1 shows that, as 
the duration that the key was illuminated 
decreased across sessions, responses on both 
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Fig. 1. Average number of key pecks (to the correct key) emitted per trial throughout the sample duration over 
Phase I and Phase II. 


2-s and 10-s trials also decreased. This result 
suggests that the onset of the keylights evoked 
responding. However, all birds continued to 
respond to the dark keys to some extent 
during the 10-s trials in the 100% fading 
sessions (i.e., when the two keys were turned 
off during the houselight and only the right- 
green key was illuminated afterwards). 

Figure 2 shows how the fading procedure 
affected the distribution of dark-key responses. 
The total number of pecks to the left and right 


keys when they were dark, emitted per session 
during the 10-s trials and averaged across the 
last five sessions of Phase II (i.e., 100% fading), 
is plotted as a function of elapsed duration. 
Temporal control over responding is evident. 
All pigeons responded more to the 2-s key at 
the start of the houselight, with responding 
peaking at approximately 2 to 3 s after house- 
light onset. Responding to the 10-s key then 
increased systematically for the remainder of 
the sample duration. This pattern was evident 
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Sample duration (s) 

Fig. 2. Total number of key pecks emitted per session during the last sessions of Phase II over the 10-s trials. Solid and 
dotted lines correspond to the left and right keys, respectively. 


for all pigeons, with exception of Pigeon 227, 
which rarely pecked the 10-s key. These results 
suggest that the fading procedure successfully 
established the mediating behavior for the 
Errorless group that should facilitate the 
acquisition of the temporal discrimination. 

The presence of a well-differentiated pattern 
during the sample also supported the follow¬ 
ing hypothesis: If the pattern remained stable 
in subsequent phases, then the moment the 
pigeons switched between the two dark keys 
(i.e., ceased to peck the left key and started to 
peck the right key) should predict the bisec¬ 


tion point when they were exposed to the 
stimulus generalization test trials. We return to 
this issue below. 

Phase III: Gradual introduction of the S—. 
During the two sessions in which the pigeons were 
exposed to choice trials for the first time (total of 
48 trials), the Errorless group made an average of 
46 correct responses (95.8%), 1 incorrect response 
(2.1%), 0.8 cancelled trials (1.7%), and virtually 
no missed trials (0.4%). Pigeons 202 and 227 did 
not make any errors, 1 pigeon (228) made only 
one error, and the remaining 2 pigeons (201 and 
208) made three errors. 
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Fig. 3. Number of responses to the S— made by each bird during all choice trials until they reached the 
learning criterion. 


Baseline Training 

When exposed to sessions consisting of 
exclusively choice trials, the Errorless group 
made an average of 3 errors (Pigeons 201, 202, 
208, 227 and 228 made 4, 5, 6, 0 and 0, 
respectively) and the Trial-and-Error group 
made an average of 52 errors (Pigeons 203, 
204, 205, 206, 225 and 226 made 47, 68, 44, 44, 
56 and 52 errors, respectively) until they 
reached the criterion of at least 85% correct 
responses over three consecutive sessions. 
Whereas pigeons in the Errorless group met 
the 85% criterion in all discrimination ses¬ 
sions, and therefore reached the criterion in 
the minimum number of 3 sessions (180 
trials), the Trial-and-Error group met the 
criterion only after an average of 6.5 sessions 
(390 trials). An independent-means t test 
comparing the number of baseline sessions 
to acquire the discrimination showed that the 
Errorless group needed significantly fewer 
sessions to reached the criterion than the 
Trial-and-Error group, t( 9) = —4.40, p < .01. 
These results show that the mediating behav¬ 
ior facilitated the acquisition of the temporal 
discrimination. 


Next we compared performance of the two 
groups during all choice trials. For the Errorless 
group, this included the 48 trials presented in 
Phase III of pretraining and the 180 trials 
presented in baseline, for a total of 228 trials; 
for the Trial-and-Error group, this included 380 
trials on average (range: 300 to 600 trials). 
Figure 3 shows the number of responses to the 
S— (errors) made by each pigeon. Whereas 
pigeons in the Errorless group made an average 
of 4 errors (98% correct responses; Pigeons 
201, 202, 208, 227 and 228 made 5, 5, 9, 0 and 1 
errors, respectively), those in the Trial-and- 
Error group made an average of 52 errors (86% 
correct responses; Pigeons 203, 204, 205, 206, 
225 and 226 made 47, 68, 44, 44, 56 and 52 
errors, respectively). It is important to note that 
the 2 experienced birds in the Errorless group 
(Pigeons 227 and 228) made 0 and 1 error, 
respectively. An ANOVA with sample duration 
and group as factors found a significant effect 
of group, T(l,9) = 117.78, p < .001, but no 
effect of sample duration, or of their interac¬ 
tion. Although the contingency for cancelling 
trials remained in place during baseline train¬ 
ing for the Errorless group, virtually no trials 
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were cancelled (0.4%). Specifically, with the 
exception of Pigeons 202 and 208, which 
throughout 180 choice trials had one and three 
cancelled trials, respectively, none of the other 
pigeons had cancelled trials. 

Performance during the last three sessions 
of baseline training did not differ appreciably 
between the two groups. The Errorless group 
averaged 98% correct responses (range: 96%- 
100%) and the Trial-and-Error group averaged 
96% (range: 94%-98%). A repeated-measures 
ANOVA with sample duration and group as 
factors found no significant effect of group, 
T(l,9) = 4.40, sample duration, F( 1,9) = 5.01, 
or their interaction, F( 1,9) = 0.11 (all p’s >.05, 
although the differences between groups 
almost reached significance, p = .06). In short, 
at the end of baseline training, the two groups 
were similarly accurate. 


Stimulus Generalization Test 


Figures 4 and 5 show the individual psycho¬ 
metric functions obtained during the stimulus 
generalization test for the Errorless and Trial- 
and-Error groups, respectively. Note that dur¬ 
ing the stimulus generalization test half of the 
trials were training trials (2-s and 10-s samples) 
and the other half were test trials (2.99-s, 4.47-s 
and 6.69-s samples). Data from both training 
and test trials are shown in Figures 4 and 5. For 
all birds, the probability of a “long” response, 
/?(“long”), increased monotonically with sam¬ 
ple duration. To describe the results concisely 
and estimate the bisection points, a three- 
parameter logistic function was fitted to the 
data from individual pigeons: 


P(“short”|0 = 



( 1 ) 


where t is the sample duration, /(is the mean of 
the function (an estimate of the bisection 
point), and a is the standard deviation. Overall 
Equation 1 described the data well, accounting 
for 97.5% to 100% of the variance (VAC) in the 
individual data (M = 99.3%). Average values of 
the bisection point (/<) were 4.46 s (SEM = 
0.52) for the Errorless group and 4.34 s (SEM 
= 0.51) for the Trial-and-Error group. In both 
cases, average bisection points were closer to 
the geometric mean (4.47) than the arithmetic 
mean (6) of the two sample durations. It 
should be noted that for 3 pigeons (2 from 


the Errorless group, 201 and 208, and 1 from 
the Trial-and-Error group, 204) the accuracy 
at the longest training stimuli was relatively 
low. 

Figure 6 shows that the average psychomet¬ 
ric functions did not differ between the two 
groups. An ANOVA with group and sample 
duration as factors found a significant effect of 
duration, P(4,36) = 71.62, p < .001, but the 
effect of group and the group x sample 
interaction was not significant. Furthermore, 
there were no significant differences between 
the groups in terms of the parameters of the 
logistic fits and VAC (for all t tests, p > .05). 

As mentioned before, we attempted to 
predict the bisection points for the Errorless 
group on the basis of the mediating behavioral 
pattern learned during the fading procedure. 
The distribution of the dark key pecks on the 
10-s trials revealed a well-differentiated, highly 
stereotypical motion pattern for all birds: At 
trial onset, they pecked the left key (corre¬ 
sponding to the 2-s trials); after a few seconds, 
they switched to the right key (corresponding 
to the 10-s trials) and pecked this key until the 
end of the trial. The moment of switching 
should predict the bisection point. 

To determine whether this was the case, we 
calculated the proportion of pecks to the right 
key as a function of sample duration for all 10-s 
trials during the last five sessions of Phase II 
(100% fading). Equation 1 was fitted to those 
data and accounted for a very high percentage 
of the variance in the individual data (M = 
99.7%; range: 99.6% to 100%). Figure 7 shows 
the individual and average data. The average 
value of the bisection point (/() was 4.42 s 
(SEM = 0.20). A t test showed that there was 
no significant difference between these results 
and those obtained from the stimulus gener¬ 
alization test for the Errorless group (ji = 
4.46 s), t( 4) = —0.07, p = .95, suggesting that 
the hypothesis that the time when pigeons 
switched from pecking the left key to pecking 
the right key during the sample duration 
(when they were not illuminated) can predict 
the bisection points during the stimulus 
generalization test. In addition, we correlated 
for each pigeon the psychometric function 
obtained from the distribution of dark key 
pecks on the 10-s trials (Figure 7) and the 
psychometric function from the stimulus 
generalization test for the Errorless group 
(Figure 4). Results showed that correlations 
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Bird 201 Bird 202 




Bird 20 8 



Bird 227 



Bird 228 



Fig. 4. Preference for the right green key, the key associated with 10-s sample, for the Errorless group. Filled circles 
correspond to the data from the stimulus generalization test and the curves through the data points are the best-fitting 
logistic curves (see Equation 1). The psychometric functions combine data from training trials (2-s and 10-s samples) and 
test trials (2.99-s, 4.47-s and 6.69-s samples). 


were highly positive for most pigeons (M = 
0.88; Pigeons 201, 202, 208, 227 and 228: 1.00, 
0.98, 0.85, 0.69 and 0.90, respectively), con¬ 
firming our hypothesis. 


Delay Test 

Figures 8 and 9 show the proportions of 
correct responses on 2-s and 10-s trials during 
the delay test. For all pigeons from the Trial- 
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Bird 203 




Bird 226 



Bird 204 



Bird 206 



Bird 226 



Sample duration (s) 

Fig. 5. Preference for the right green key, the key associated with 10-s sample, for the Trial-and-Error group. Filled 
circles correspond to the data from stimulus generalization test and the curves through the data points are the best-fitting 
logistic curves (see Equation 1). As in Figure 4, the psychometric functions combine data from training trials (2-s and 10-s 
samples) and test trials (2.99-s, 4.47-s and 6.69-s samples). 


and-Error group, accuracy decreased with the 
delay interval. For 5 pigeons, accuracy was 
differentially degraded by delay on the 10-s 
sample trials, but for Pigeon 204 accuracy 
decreased more on 2-s than 10-s trials. Two 


subjects from the Errorless group (Pigeons 201 
and 208) performed similarly to the majority 
of the subjects from the Trial-and-Error group, 
and 1 bird (Pigeon 202) performed the same 
as Pigeon 204. Interestingly, on both types of 
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Fig. 6. Average data from the stimulus generalization 
test (symbols) and average of the individual fitted curves 
(lines) for the Errorless and Trial-and-Error groups. 


trials, Pigeons 225 and 228 from the Errorless 
group showed almost no reduction in accuracy 
with delay. 

An ANOVA with group, sample duration, 
and delay as factors found a significant effect 
of delay, F (4,36) = 22.62, p < .001, 

confirming that as the delay duration in¬ 
creased, the proportion of correct responses 
decreased for both groups. There was also a 
significant interaction between sample dura¬ 
tion and delay, F (4,36) = 3.20, p < .05. This 
interaction indicates that correct responses 
decreased with delay more on the 10-s trials 
than the 2-s trials. There was also a significant 
three-way interaction, F (4,36) = 2.92, p < .05, 
because the just stated two-way interaction 
happened only in the Trial-and-Error group. 
That is, the Errorless group showed no 
significant difference between the perfor¬ 
mances on both trials. 

These results suggest that the errorless 
training made pigeons’ performance less 
sensitive to delay on 10-s sample trials. They 
are consistent with the predictions based on 
the behavioral pattern of the Errorless group: 
The pigeons were pretrained to peck at the left 
key at the onset of the sample and then, after a 
few seconds had elapsed, to switch and peck at 
the right key until reinforcement occured. 
Therefore, when a delay was included between 
the offset of the sample and the onset of the 
choice keys, we expected those pigeons to 
stand in front of the right key for longer than 
the pigeons from the Trial-and-Error group, 
which were not trained to emit this mediating 
behavior. 


Reversal Test 

Figure 10 shows the number of errors made 
by each pigeon during the acquisition of the 
new discrimination. Whereas the Errorless 
group made an average of 173 errors before 
reaching the criterion (range: 104-234), the 
Trial-and-Error group made an average of 108 
errors (range: 63-183). The individual data 
show that the pigeons for the Trial-and-Error 
group generally made fewer errors, but the 
difference between the group averages was not 
significant, t( 9) = 2.13, p > .05. 

DISCUSSION 

The main goal of the present study was to 
investigate whether a fading procedure could 
be used to teach a conditional discrimination 
with few or no errors. The fading procedure 
successfully established the intended mediat¬ 
ing behavioral pattern: At the onset of the 
sample, the pigeons started to peck the 2-s key, 
with the response rate to this key attaining a 
maximum at 2 s or 3 s into the sample, and 
then they switched to the 10-s key, with the 
response rate to this key increasing for the 
remainder of the sample duration. Clare and 
Jones (2008) mentioned that the early animal 
experiments on errorless learning dealt only 
with very simple motor behaviors and that it 
would be difficult to imagine how complex 
behaviors could possibly be established using 
an errorless procedure. Our results showed 
that contrary to Clare and Jones’ conclusion, it 
is possible to establish a complex behavioral 
pattern through an errorless learning proce¬ 
dure. 

During the gradual introduction of the S—, 
the Errorless group averaged only one re¬ 
sponse to the S—, with 2 out of 5 pigeons 
making no errors at all. These results were 
similar to or better than those obtained in 
previous reports of successful errorless simple 
discriminations (e.g., Terrace, 1963a). 

Comparisons of the Errorless group and the 
Trial-and-Error group on all discrimination 
trials before criterion was reached revealed a 
significant difference between the average 
number of errors made by each group (4 
and 52, respectively). Notably, one pigeon 
from the Errorless group made 0 errors during 
all discrimination trials and another made only 
1 error. These results show that the fading 
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Fig. 7. Proportion of pecks to the right key as a function of sample duration for all 10-s trials during the last five 
sessions of Phase II (100% fading), for the Errorless group. 


procedure was effecdve in teaching the error¬ 
less conditional discrimination, with all the 
pigeons learning the temporal discrimination 
with few or no errors. Ferster and Hammer 
(1966) speculated whether a fading procedure 
could be devised with complex stimuli to 
produce stimulus control without S— respond¬ 


ing. Our study is the first to show that a fading 
procedure can be used to teach a complex, 
conditional discrimination with few or no 
responses to the S—. Because one of the 
reasons for the decreased interest in errorless 
learning by basic researchers was that it 
applied only to a restricted range of discrim- 
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Bird 201 Bird 202 



Delay duration (s) 

Fig. 8. Proportion of correct responses during the delay test for the Errorless group. Solid and dotted lines 
correspond to the 2-s and 10-s trials, respectively. 


ination procedures, the extension of the 
fading procedure to a conditional discrimina¬ 
tion is particularly noteworthy. 

Rilling (1977) claimed that the early intro¬ 
duction of the S— during the discrimination 
training is the most powerful procedural 
variable in the errorless learning procedures. 
However, in our experiment the S— corre¬ 
sponding to each S+ was introduced only after 
an average of 46 S+ only sessions, showing that, 


contrary to Rilling’s claim, the moment the S— 
is introduced may not be fundamental to 
learning a conditional discrimination without 
errors. 

We hypothesized that if the mediating 
behavior established by the fading procedure 
remained stable, the time of switching from 
responding to the left key to responding to the 
right key should predict the bisection point of 
the stimulus generalization gradient. Results 
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Fig. 9. Proportion of correct responses during the delay test for the Trial-and-Error group. Solid and dotted lines 
correspond to the 2-s and 10-s trials, respectively. 


are consistent with this hypothesis, showing 
that (a) the average bisection point predicted 
from the pigeons’ behavior and that obtained 
during the stimulus generalization test were 
similar (Ms = 4.42 and 4.46), and (b) the 
behavioral pattern developed during pretrain¬ 
ing and the pigeons’ performance during the 
stimulus generalization test showed high pos¬ 
itive correlation for all pigeons (M = .88). 
Machado and Keen (2003) have also com¬ 
pared the pigeons’ behavioral patterns and 
their performance during a stimulus general¬ 
ization test. In their experiment, they used a 
long chamber equipped with one key and 


feeder at each end, and with sensitive floor 
panels to record the subject’s location. The 
pigeons had to choose one side key after a 
short sample and the other side key after a 
long sample. At the end of training, the 
pigeons started each trial by moving to the 
“short” side; they stayed there for a while, and 
then, if the trial continued, moved to the 
“long” side. For 4 out of 7 cases, the 
cumulative distribution of departure times 
(i.e., the time the pigeons left the “short” 
side) fitted well the psychometric function 
obtained from the stimulus generalization test. 
Thus, pigeons might display behavior during 
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Fig. 10. Number of responses to the S— made by each bird until they reached the learning criterion during the 
reversal test. 


trials that suggests a kind of mediation of the 
temporal discrimination that could function 
similarly to the explicitly trained mediating 
behavior in the Errorless birds of the current 
study. This may account in part for the fact 
that, overall, results from the stimulus gener¬ 
alization test were similar for the Errorless and 
Trial-and-Error groups. 

We also hypothesized that the performance 
of the Errorless group should be more 
resistant to disruption because their mediating 
behavior was explicitly trained. Because after a 
10-s sample these pigeons should be standing 
in front of the right key, we predicted that 
when delays between the offset of the sample 
and the onset of the comparison stimuli were 
introduced (delay test), the accuracy levels 
should be higher than for the Trial-and-Error 
group. Although the Trial-and-Error birds may 
also have adopted, untrained by us, a kind of 
behavior that mediated their timing perfor¬ 
mance, explicit shaping and reinforcement in 
the Errorless birds established a robust and 
effective mediating performance that with¬ 
stood the disruptive effects of the delays. 
Despite variability across subjects, results sug¬ 
gested that for the Errorless group, respond¬ 
ing was overall less disrupted by the delay than 
responding for the Trial-and-Error group. In 
some sense, then, it may have been the greater 
resistance of the trained mediating behavior of 


the Errorless birds relative to the untrained 
mediating behavior of the Trial-and-Error 
birds that accounted for the difference in 
disruption during the delay tests. 

Results from the reversal test showed that all 
pigeons learned the new discrimination, and 
the two groups made similar numbers of errors 
throughout its acquisition. Together with 
previous experiments in which researchers 
successfully reversed the contingences follow¬ 
ing a discrimination learned without errors 
(Aronsohn et al., 1978; Robinson & Storm, 
1978; Schusterman, 1965, 1966; Stettner & 
Matyniak, 1969), our results challenge Rilling’s 
claim that errorless learning may not be the 
“best learning for an organism exposed to a 
changing environment” (Rilling, 1977, p. 475). 

The results of our study may have implica¬ 
tions for theories of timing, theories of 
learning and applied psychology. We consider 
each in turn. Some theories of timing do not 
assume that errors (i.e., nonreinforced re¬ 
sponses) are important for discrimination 
learning, as is the case of the popular Scalar 
Expectancy Theory (SET; Gibbon, 1977). 
However, other accounts such as the Learn- 
ing-to-Time model (LeT; Machado, 1997) 
postulate that both reinforced and nonrein¬ 
forced responses contribute to the acquisition 
of a discrimination. Previous research using a 
temporal double bisection task (e.g., Arantes, 
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2008; Arantes & Machado, 2008; Machado & 
Arantes, 2006), in which subjects are trained 
on two temporal conditional discriminations 
and later given a transfer test in which stimuli 
from the two discriminations are paired 
together, have provided some evidence in 
favor of LeT’s assumption that responses to 
both S+ and S— are important. Future 
research should further investigate the role 
of the errors by using a fading procedure 
similar to that used in the present study to 
reduce or eliminate the errors made in a 
temporal double bisection task. If the errors 
are important for the discrimination learning, 
then these animals should perform differently 
than the animals that learn with a large 
number of errors. However, if the errors are 
not important, the animals’ performance 
should be the same regardless of the number 
of errors. 

Our results are important also for theories 
of learning. For example, they challenge the 
classical generalization theory (e.g., Spence, 
1936, 1937; Hull, 1943). Specifically, because 
the S+ and S— differed only with respect to 
wavelength, were presented simultaneously, 
and the response to S+ had a long history of 
reinforcement before the S— was introduced 
(M = 46 sessions), the classical generalization 
theory would predict a generalization from the 
S+ to the S— and therefore that the pigeons 
would initially make some responses to the S— 
during choice trials. However, as we men¬ 
tioned before, the pigeons did not generalize 
from the S+ to the S—, giving few or no 
responses to the latter. 

One of the reasons why errorless learning 
attracted so much attention was because the 
S— did not seem to work as a conditioned 
inhibitor, but rather as a neutral stimulus 
(Terrace 1966b, 1967, 1971). A conditioned 
inhibitor is “a stimulus that develops during 
conditioning the capacity to decrease response 
strength below the level occurring when that 
stimulus is absent’” (Hearst, Besley & Far¬ 
thing, 1970). Proposed by Pavlov (1927), the 
concept of conditioned inhibition has attract¬ 
ed the interest of several researchers (e.g., 
Hearst et al., 1970; Jenkins, 1965; Rescorla; 
1969; Terrace, 1964, 1966a, 1972) and played 
an important role in models of discrimination 
learning (Hull, 1943, 1952; Spence, 1936, 
1937). Several researchers have criticized 
Terrace’s conclusion that the S— did not 


function as an inhibitor (e.g., Wessels, 1973; 
Wilkie & Ramer, 1974; see Rilling, 1977, for a 
review). 

Our experiment sheds some light on this 
issue. In previous experiments on errorless 
learning the S+ (e.g., red key) was always 
associated with reinforcement while the S— 
(e.g., green key) was always associated with 
nonreinforcement. In our conditional discrim¬ 
ination study, the red key and the green keys 
were associated with reinforcement and non¬ 
reinforcement (depending on the preceding 
sample duration), and both keys were present¬ 
ed simultaneously during the discrimination 
trials. During approximately the first 46 
sessions, the pigeons were presented exclusive¬ 
ly with S+ trials, responding to the keys with a 
probability close to 100%. When the corre¬ 
sponding S— stimuli were introduced, the 
probability with which a response occurred 
did not decrease, that is, the presence of the 
S— did not seem to reduce the response 
strength to the S+. These results are consistent 
with Terrace’s (1966b, 1967, 1971) assump¬ 
tions that the errorless S— is not an inhibitor. 

In summary, we showed that the errorless 
training is effective in reducing and even 
eliminating completely errors during the 
acquisition of a conditional temporal discrim¬ 
ination. Also, when exposed to stimulus 
generalization, delay, and reversal tests, the 
results from the Errorless group were similar 
to those from the Trial-and-Error group, 
although there was some indication that the 
Errorless group was less disrupted by delays. 
These results challenge some of the previous 
criticisms of errorless learning and suggest 
that the abandonment of this topic by basic 
researchers may have been premature. In fact, 
when first discovered, errorless learning 
attracted much attention because it was 
considered anomalous. Contrary to the pre¬ 
vailing view on discrimination, which as¬ 
sumed that associations with both reward 
and nonreward are necessary, in a simple 
errorless training procedure, one stimulus is 
associated with reward but the other remains 
neutral, for it is rarely or never chosen. In our 
task, both stimuli are associated with reward, 
albeit in different contexts, and yet the 
discrimination is learned without errors. 
The processes underlying errorless condition¬ 
al discrimination learning remain to be 
elucidated. 
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